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Cross-References to Related Applica tions 

5 The present application is related to United States Provisional Patent Application Serial 

Number 60/164,846 filed on November 10, 1999 for an Internet Radio and Broadcast Method 
which application is incorporated herein by this reference thereto. This patent application is 
related to United States Provisional Patent Application Serial Number 60/217,594 filed July 
Cl 11, 2000 for Online Playback System With Community Bias, and is a continuation-in-part of 

* io U.S. Patent Application Serial Number 09/709,234 filed November 9, 2000 for Internet Radio 

111 

Q And Broadcast Method, which applications are incorporated herein by this reference thereto. 

| BACKGROUND OF THE INVENTION 

H is Field of the Invention 

This invention relates to database generation and data stream transmission, and more 

particularly to biased data stream transmission method according to a community of 
subscribers or fans enjoying similar tastes. 

20 Description of the Related Art 

In an online environment, the demand for digital entertainment is limited by statute in 

the United States of America under the Digital Millennium Copyright Act (DMCA, Digital 
Millennium Copyright Act of 1998, Public Law 105-304). Legitimate providers of online 
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entertainment must adhere to the DMCA and pay license fees for the copyrighted works 
broadcast over the Internet or other online environment. Otherwise, such providers are liable 
for copyright infringement. 

The Digital Millennium Copyright Act (DMCA) addresses protections for copyrighted 

5 works transmitted online. The DMCA entitles websites that stream music to a statutory license 
to perform copyrighted sound recordings as long as they meet certain requirements. 
Compliance with these requirements by, among other ways: not streaming over a three-hour 
period, more than three songs or more than two in a row from the same recording, or four 
songs or more than three in a row from the same recording artist or anthology; and by 

10 transmitting songs in a noninteractive format by, for example, not allowing users to 
specifically create or request programming on demand or to hear programming at designated 
times. Additionally, compliance with the DMCA requires that advance song or artist play lists 
not be published. 

In an online environment, the content provider may "narrowcast" the data feed to a 
is single individual and still comply with the DMCA even though thousands of individual 
narrowcast transmissions are made simultaneously. For example, so long as each individual 
narrowcast does not violate the DMCA, compliance with the DMCA is maintained. 

"Narrowcasting" is a term that may be new in the art. As a contrast to "broadcasting" 
where information is broadcast on a wide basis and generally available to anyone with a tuned 
20 receiver, "narrowcasting" arises from the individually addressable data packets used in TCP/IP 
protocol. The packets are addressed to individual computers and include almost all forms of 
data transmission over the Internet. Consequently, when broadcasting occurs on the Internet, 
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it is generally composed of a bundle of narrowcast packets as each one must be individually 
addressed to the computers of the audience. This is true even though several computers are 
receiving the same content at the same time. Each computer must be individually addressed 
even though the packets are identical. When demand is high for Internet content such as a live 
i performance or transmission, bandwidth may not be sufficient for all who request 
transmission. 

Due to the nature of Internet communications and TCP/IP protocol, narrowcasting is 
one of the basic and easy ways in which to transmit information packets. Multicasting may 
also be used (See Bob Quinn, Killer Network Apps That Aren't Network Killers, Dr. Dobb's 

o Journal October 1997), but has drawbacks due to technical obstacles in effecting a multicast on 
the open Internet. Other protocols (such as FTP) also exist. 

Under the LAUNCHcast™ system (the subject of the 09/709,234 patent application 
indicated above), each subscriber may "tune" his or her narrowcast by expressing preferences 
that are recorded and preserved in an account associated with the user/subscriber. 

, 5 The LAUNCHcast™ system provides a means by which DMCA compliance can be 

maintained while biasing narrowcast transmissions according to audience/individual 
preferences. By soliciting, receiving, and recording an individual's preferences regarding (for 
example) a music data stream, LAUNCH Media, Inc. provides digital audio feed to a 
subscriber that both complies with the DMCA as well as catering to the individual's musical 

20 tastes. If the musical tastes of the individual are limited, additional music may be used to fill 
in "airtime" or "nettime" that cannot be filled with the individual's favorite songs as such 
transmission would violate the DMCA. Conversely, an individual with broad tastes could have 
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very few works transmitted in the data stream that fall outside of the individual's tastes. 

Very often, people who enjoy one type of music or artist also enjoy other types of 
music or artists so that an appearance of association between the two occur without an obvious 
causal link. For example, individuals who enjoy music by Barry Manilow might also enjoy the 
music of Neil Sedaka in a high percentage that may exceed random statistical occurrence. 
Consequently, when accompanied by a rating system or engine, individuals who enjoy Barry 
Manilow might welcome music by Neil Sedaka although they may have never heard music by 
Neil Sedaka before. 

The present invention allows enhancement of narrowcast transmission for the listener's 
or consumer's enjoyment while maintaining compliance with the DMCA. By associating 
communities of listeners/consumers around specific artists or genres, subscribers or listeners of 
an online data stream entertainment service are provided with a more focused and enjoyable 
experience as the data stream is catered to their preference by using a community bias based 
upon those who enjoy such artists, an individual artist, genres, or an individual genre. 

Note should be taken that the method described herein pertains not only to audio data 
streams, but any sort of data stream where preferences may be present, including video and 
multimedia. As entertainment data streams are particularly susceptible to strong personal 
preferences, the present invention resolves a need for providing dynamic accommodation of 
expressed preferences in a community of subscribers or listeners while complying with 
applicable copyright law. 



SUMMARY OF THE INVENTION 

The present invention provides for a biased data stream that is biased according to those 



-9774 



who prefer data streams of particular types. Using the example above, a community enjoying 
Barry Manilow could be used to bias a data stream towards both songs by Barry Manilow and 
those songs that the members of the Barry Manilow community enjoy. Consequently, through 
the use of preferences expressed by feedback of each individual member of the Barry Manilow 
community, a Barry Manilow-based radio station or data stream set emerges, Narrowcasting 
based upon such a biased data stream may then be subject to DMCA constraints so that no one 
narrowcast transmission violates the DMCA, yet the data stream transmission is biased 
according preferences expressed by the Barry Manilow community as a whole. 

The biasing of such a data stream becomes more robust and more reliable with greater 
numbers of members and when such members express a large number of preferences regarding 
the type of music they enjoy. 

Note should be taken that the term "music" as used herein is used as a shorthand for 
any data stream subject to taste or preference. Music data streams form a basic analogy from 
which all other data streams may be comparably likened, unless otherwise indicated. 
Additionally, the use of Barry Manilow as an artist of preference is arbitrary and could be 
substituted by current, modern, or classical artists such as Melissa Etheridge, Karen Carpenter, 
Rosemary Clooney, Phil Harris, Hank Williams, Led Zeppelin, Luciano Pavarotti, or Spike 
Jones. 

OBJECTS OF THE INVENTION 

It is an object of the present invention to provide more entertaining online data feeds. 
It is another object of the present to provide more entertaining data streams by 
providing a biased data stream according to a listener's/consumer's preferences. 
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It is yet another object of the present invention to provide a more entertaining data 
stream by biasing a data stream according to a community expressing preferences for 
significant components of the data stream, such as an artist or genre. 

It is yet another object of the present invention to provide a community biased music 
data stream according to a community expressing preferences for music carried by said data 
stream, such as an artist or genre. 

These and other objects and advantages of the present invention will be apparent from a 
review of the following specification and accompanying drawings. 

These and other objects and advantages of the present invention will be apparent from a 
review of the following specification and accompanying drawings. 

B RIEF DESCRIPTION OF THE DRAWING S 

Figure 1 shows an exemplary page for an artist, in this case Tori Amos. 
Figure 2 shows a similar exemplary artist page with the Fan Station option highlighted. 
Figure 3 is an isolated view of the Fan Station option shown in Figures 1 and 2. 
Figure 4 is an enlarged view of Figure 3. 

Figure 5 shows an alternative exemplary page for an artist, in this case Tori Amos. 
Figure 6 shows a similar alternative exemplary artist page with the Fan Station option 
highlighted. 

Figure 7 is an isolated view of the alternative Fan Station option shown in Figures 5 

and 6. 

Figure 8 is an enlarged view of Figure 7. 

Figure 9 is a diagrammatic view of steps taken in the present invention 
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DESCRI PTION OF THE PREFERRED EMBODIM ENT(S) 

The detailed description set forth below in connection with the appended drawings is 
intended as a description of presently-preferred embodiments of the invention and is not 
intended to represent the only forms in which the present invention may be constructed and/or 
utilized. The description sets forth the functions and the sequence of steps for constructing and 
operating the invention in connection with the illustrated embodiments. However, it is to be 
understood that the same or equivalent functions and sequences may be accomplished by 
different embodiments that are also intended to be encompassed within the spirit and scope of 
the invention. 

The present invention resides in the establishment of a community based upon shared 
musical tastes. Upon receiving and recording a statistically significant number of preferences 
and feedback regarding songs, those who prefer an artist may be distinguished from other users 
who may form a background against which fans of such an artist are distinguished. 

Using as an example the contemporary artist Tori Amos, Figures 1-8 show alternative 
commercial presentations of the present invention. As for almost all artists in its library, 
LAUNCH Media maintains home pages for artists from which users/subscribers may select 
links to additional information, including the purchase of works by the artist. As an option on 
the home page, interested individuals may select to hear an audio stream based upon the 
preferences of users who like that artist, in this case, who like Tori Amos. 

By selecting the "listen" or "watch" links in the Fan Station section of the Tori Amos 
home page (Figures 3 and 4 and Figures 7 and 8), individuals can receive data streams biased 
according to a community that likes Tori Amos. As the Tori Amos community may tend to 
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share other musical tastes, the data stream that results from the Fan Station link selection may 
also entertain the individual so selecting the link as that individual's tastes may correspond to 
the tastes of the Tori Amos community as a whole just as it did with the artist Tori Amos. 

In order to determine a community's preferences, only those individuals in the 
5 subscriber database who are "fans" of the artist are used to determine the community's 
preferences. The term "fan" may be arbitrarily defined as those individual subscribers who 
rate Tori Amos as a 70 or more on a scale of 100 with 0 being a least favorite artist and 100 
being a most favorite artist. The choice is arbitrary but needs to reflect a bias sufficient to 
entertain, or even delight, those who choose to listen to the community channel. 
io Upon determining the community of interest (Tori Amos fans, for example), collateral 

data regarding other preferences are gathered from those same individuals who are designated 
fans of Tori Amos. For example, in one embodiment, for each member of the community, all 
other rated artists besides Tori Amos are inspected. Those artists who also scored 70 or higher 
are noted and temporarily stored in a database. After all of the member accounts of the 
is community have been polled, those artists who are present in 70% of the accounts may be 
chosen as artists whose music will also be transmitted as secondary musical selections in 
narrowcast to those who choose the Tori Amos Fan Station. 

In an alternative embodiment, the collateral artists may be chosen according to 
popularity with no floor threshold (of 70% as in the embodiment above, or otherwise). In 
20 another alternative embodiment, songs rated by the community may take precedent over artist 
ratings such that individual songs are selected for narrowcast transmission from community 
preferences as opposed to portfolios of songs according to different artists (again according to 
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community preferences). 

In this way, a community may be defined and its preferences determined. Of course, 
other data streams subject to preference or taste may be substituted for the music/audio data 
stream as set forth in the example above, including video, multimedia, or otherwise. 
5 The present invention is shown diagrammatically in Figure 9. As shown in Figure 9, 

the present invention 900 provides steps for achieving the community bias system in order to 
provide data streams consistent with such community preferences. The online playback system 
55 with community bias 900 of the present invention begins first with establishing a statistically- 
O significant database 910. This database may be a database comprised of all users of a system 
3 io such as LAUNCHcast™ or the like. Such a statistically-significant database has entries with 
7 artistic preferences of the individual subscribers. Such preferences may include artists and 
St songs preferred and not preferred (liked and disliked), as well as albums that the subscribers or 
y recipients prefer or do not prefer. 

The entire subscriber community generally defines the artistic or preferential "space" in 
is which the present invention operates. Using such a geometrical point of view, certain sub- 
areas of the artistic database may then be the subject of the community preference system 900 
set forth herein. In order to achieve the present invention, certain delimitations must be made 
as to what defines a community, and the preferences expressed by the subscriber/recipient 
serve in this capacity. 

20 Statistical significance is a relative term. The goal of the present invention is to provide 

entertainment or other desired data streams to the recipients. Here, the data streams are songs 
or music videos. However, other data streams subject to subscriber databases where 

10 
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preferences are expressed for the content or type of data stream may also put to good use the 
present invention and are within the scope of the present invention and of the claims set forth 
herein. Statistical significance arises in the form of certain threshold criteria by which certain 
preferences are deliminated and/or distinguished from others. Generally, those who listen to 
5 country music may not want to also listen to heavy metal music. Those who would prefer rap 
may also like to listen to hip-hop music. Those who enjoy classical music may not enjoy 
swing or polka music. Depending upon the available databases of both subscribers and data 
streams, certain subgenres may be available such as all-Mozart or all-Beethoven community 
channels. 

10 While feedback may be obtained from the recipients of the community-biased data 

streams, generally the present invention uses the rule of thumb of approximately "70" as the 
rating threshold by which a person is considered to be a "fan" of the artist or the like. The 
"70" rating could be interpreted as indicating that the artist is in the top one-third (1/3) of the 
individual's preferred artists. By dwelling in this top 1/3 area, a community may be defined, 

is although the exact numerical criteria may depend upon the range of the "space" available for 
use in the present invention, as well as the number of subscribers and data streams. Generally, 
the broader and more numerous the original and primary database of subscribers and 
datastreams, the higher and more exclusive the threshold rating may be. 

Upon establishing a statistically-significantly database 910, certain criteria must be 

M established for determining a community's bias 920. Upon choosing that threshold, the 
statistically-significant database 910 is then filtered, sorted, or evaluated, to determine what 
trends are present with respect to fan or subscriber preferences. As mentioned above, the 
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rating of an artist of approximately above 70 on a scale of 0 - 100 is considered to be a 
relevant and significant threshold. The use of artists to define a genre or a consistent theme 
with respect to music generally arises from the fact that artists tend to write the same kind of 
music or the same type of music much in the same way as Vivaldi and Mozart had their own 
separate and distinct styles. 

Upon determining the trends in the fan community 920, a selection of individual stream 
elements may be made 930. Such stream elements are generally in conformance with two 
criteria: the community bias trends established in step 920, as well as any applicable copyright 
law. In the United States, the Digital Millenium Copyright Act (DMCA) generally controls 
such on-line transmissions of copyright works such as sound recordings and audio-visual 
works. 

The stream selection step 930 may be achieved in two modes of transmission. One 
mode would be a narrowcast mode where different individual streams are transmitted to 
different recipients who have chosen and are currently listening to a community fan station. 
Alternatively, one transmission stream could be distributed simultaneously to all current 
listeners of the fan station/community channel. Both of these transmission methods are in 
compliance with the DMCA and provide alternative means by which the present invention 900 
may be realized. 

When an individual hears a song on a community channel that he or she ("he") would 
like to rate, the rating tool may be made available to him via the player tool. The rating so 
made by the individual is then recorded in his or her preferential settings as a subscriber to the 
database 910. The user's ratings may indirectly affect the data stream selection 930 as it may 
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form part of the database used to determine the community and the stream selected for the 
corresponding channel. The user must be a fan of the artist, for example, to effect that artist's 
community channel. 

Once the stream selection process 930 has been performed, the stream is then broadcast 
5 to the recipient(s)/subscriber(s) 940. The recipients then enjoy the receipt of the data streams 
and may be exposed to new music according to their own expressed preference indicated by 
subscribing to the fan station. Consequently, an individual who likes country music and 
«1 chooses a Hank Williams community channel may be exposed to music by Porter Wagner 
H which he or she may also like. The same is similarly true for contemporary musical style such 
Ulio as rap and hip-hop, as well as musical styles developed in the past, currently under 

is : 

!L development, or to be developed in the future. 

^ In order to maintain the relevance of the community channel/fan station, the trend 

5 determining step 920, stream selection step 930 may be re-engaged after a certain period of 
time ranging from one week to several months 950. This allows for those who enjoy a certain 
is type of music to benefit from currently-popular related styles and to allow the stream selection 
process 930 to be updated to reflect current tastes. 

While the present invention has emphasized entertainment in the form of data streams 
relating to songs, sound recordings, and audio visual work such as music videos, the present 
invention is also applicable to data stream transmission systems that must comply with a 
20 regulatory scheme (such as the DMCA) in view of express preferences for content and/or type 
(such as the music individual persons like and dislike). Certain automated processes may 
benefit from the present invention, as machine-implemented processes may operate under a 

13 
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wide variety of conditions and benefit from the transmission of data streams such as 
information- and/or content-dependent data streams dependent upon a wide variety of factors, 
including geographic location, climate, other environmental conditions, or otherwise. For 
example, the data streams may be sets of suggested instructions for artificially-intelligent 
systems operating under situations requiring problem-solving abilities. 

The source code listing sets forth with particularity certain software methods by which 
one embodiment of the present invention may be achieved. The listing is believed to provide a 
full and complete disclosure of one embodiment of the present invention. 

While the present invention has been described with regards to particular embodiments, 
it is recognized that additional variations of the present invention may be devised without 
departing from the inventive concept. 

TEXT LISTING OF SOURCE CODE 

The following copyrighted source code provides a realizable embodiment of the present 
invention and is presented by way of example and not by limitation. Other source code and 
compilations thereof may implement the present invention without duplicating the following 
source code. 

package com. launch. rm.lc.SimilaritiesEngine; 

import java.util.Hashtable; 
import java.util. Enumeration; 
import java.util.Vector; 

I** 

* This class finds a bunch of items that a group of users have 

14 
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* in common that they've rated highly. The items are sorted from 

* highest group rating to lowest group rating. 

* ©author Jeff Boulter 

5 * ©author John Veilleux 
*/ 

public class Consensus 

private Hashtable contenders = new Hashtable(); 
10 private Vector finalistlDVec = new Vector(); 

private int item ID = -1; 
private int ratingCount = 0; 



a 
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jkk 

* Creates an empty consensus. 
*/ 

public Consensus() 

{ 
} 

jkk 

* Creates a consensus with an item that should be excluded from 

* the users' ratings. 

* 

* @param itemID the ID of the item to exclude 
*/ 

public Consensus( int itemID ) 
{ 

this, item ID = itemID; 

} 

* Creates a consensus where the list of items generated doesn't 

* have to exclude a specific item. 

* @param userRatings the user ratings * 
*/ 

public Consensus( Vector userRatings ) 
{ 

addRatings( userRatings ); 

} 

f** 

* Creates a consensus where the given item must be excluded 

* from the list of items that's generated. 
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* @param itemID the ID of the item to exclude 

* @param userRatings the user ratings 
*/ 

public Consensus( int itemID, Vector userRatings ) 
{ 

this.itemID = itemID; 
addRatings( userRatings ); 

} 

J** 

* Polls the group of users for their final list of items. 

* ©return the list of item ID's ordered from highest to lowest 

* group rating 
*/ 

public OrderedList poll() 
{ 

OrderedList result = new OrderedList(); 
Integer ratingltemID = null; 
GroupRating groupRating = null; 

for ( int i = finalistlDVec.size() - 1 ; i >= 0; i - ) 
{ 

ratingltemID = (Integer) finalistlDVec.elementAt( i ); 
groupRating = (GroupRating) contenders.get( ratingltemID ); 

result.add( groupRating. get(), groupRating ); 

} 

return result; 

} 

J** 

* Gets the total number of ratings within the pool of users. 
* 

* @return the rating count 

*/ 

public int getRatingCount() 
{ 

return ratingCount; 

} 

J** 

* Adds ratings to the consensus by users who will determine the 
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* final list of items. 

* @param userRatings the vector containing each user's 

* ratings 
7 

public void addRatings( Vector userRatings ) 
{ 

Rating r; 
r = null; 

for ( int i = userRatings.size() - 1 ; i >= 0; i - ) 
{ 

r = (Rating) userRatings.elementAt( i ); 

if (r.itemID != itemID) 
{ 

add( r ); 

} 

} 

} 

* Adds a rating to be used in the calculation of a contender 

* item's group rating. Once an item gets a specified minimum 

* number of ratings to calculate a group rating, it gets put 

* into the finalist list. 
7 

private void add( Rating r ) 
{ 

Integer ratingltemID = new lnteger( r.itemID ); 

GroupRating contenderGR = (GroupRating) contenders. get( ratingltemID 



if ( contenderGR == null ) 
{ 

contenderGR = new GroupRating( r.itemID ); 
contenders. put( ratingltemID, contenderGR ); 

} 

else if ( contenderGR. getNumRatings() == ( 
SimilaritiesConstants.MIN_FANS_FOR_RECOMMENDED_ITEM - 1 ) ) 

{ 

finalistlDVec.addElement( ratingltemID ); 

} 
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ratingCount ++; 
contenderGR.add( r ) 



t Patent 
01-9774 



package com. launch. rm.lc.SimilaritiesEngine; 

import com. launch. rm.lc.PlaylistGenerator.Constants; 
import java.util.Vector; 
5 import java.io.*; 

import java.sql.ResultSet; 



I** 

io * This class generates a file containing items and their similar 

* items. This is for debug purposes only; components used in the 

* calculations of the similarities are printed out. 

* @author John Veilleux 
public class DataFileGenerator 

¥ { 

1 private final static int MAX_ITEMS_TO_WRITE = 1 00; 

{ private final static int MAX_SIMILAR_ITEMS_PER_ITEM = 10; 

private final static String OUTPUT_FILENAME_ROOT = "WexportW"; 
private final static String OUTPUT_FILENAME_SUFFIX = "Similarities.txt"; 
private final static String TYPE_STRING[] = { "Artist", "Song", "Album", "Artist" }; 

static 
{ 

System. setErr( System. out ); 

} 



public static void main( String args[] ) 
{ 

Integer itemID; 
Byte itemType; 
Byte ratingType; 
SimilaritiesEngine engine; 
Vector itemlDVec; 
OrderedList groupRatingList; 
ResultSet rs; 
PrintWriter writer; 
GroupRating gRating; 
double gRatingValue; 
String headerStrl ; 
String headerStr2; 
String itemSQL; 
String ratingSQL; 
String itemParamStrl; 
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String itemParamStr2; 
String ratingParamStM ; 
String rating ParamStr2; 

itemID = null; 
itemType = null; 
ratingType = null; 
engine = null; 
itemlDVec = new Vector(); 
groupRatingList = null; 
rs = null; 
writer = null; 
gRating = null; 
gRatingValue = 0; 
headerStrl = null; 
headerStr2 = null; 
itemSQL = null; 
ratingSQL = null; 
itemParamStrl = null; 
1 20 itemParamStr2 = null; 

ratingParamStrl = null; 
ratingParamStr2 = null; 

try 

{ 

switch ( args. length ) 
{ 

case 2 

ratingType = new Byte( args[ 1 ] ); 



if ( ratingType. byteValue() < 
Constants. ITEM_TYPE_SONG || ratingType.byteValue() > 
Constants. ITEM_TYPE_ARTIST ) 

{ 

throw new Exception( "Rating type must be " 
Constants. ITEM_TYPE_SONG + ", " + Constants. ITEM_TYPE_ALBUM + ", or " + 
Constants. ITEM_TYPE_ARTIST + "." ); 

} 

40 

case 1 

itemType = new Byte( args[ 0 ] ); 
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if ( itemType.byteValueO < 
Constants. ITEM_TYPE_SONG || itemType.byteValueO > 
Constants. ITEM_TYPE_ARTIST ) 

{ 

throw new Exception( "Item type must be " + 
Constants. ITEM_TYPE_SONG + ", " + Constants. ITEM_TYPE_ALBUM + ", or " + 
Constants. ITEM_TYPE_ARTIST + "." ); 

} 

break; 
default 

throw new lnstantiationException(); 

} 

if ( ratingType != null && itemType.byteValueO == 
ratingType.byteValue() ) 
{ 

throw new Exception( "Item type cannot be equal to rating 



type." ); 



} 

Debugger.out( "DataFileGenerator started." ); 
Debugger.resetTimer( "DataFileGenerator" ); 

switch ( itemType.intValue() ) 
{ 

case Constants. ITEM_TYPE_SONG 

itemSQL = "exec sp_lcGetSongDetail_xsxx "; 
itemParamStrl = "title"; 
itemParamStr2 = "artist"; 

break; 

case Constants. ITEM_TYPE_ALBUM 

itemSQL = "exec sp_lcGetAlbumDetail_xsxx "; 
itemParamStrl = "albumName"; 
itemParamStr2 = "artistName"; 

break; 

case Constants. ITEM TYPE ARTIST 
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40 



itemSQL = "exec sp_lcGetArtistlnfo_xsxx "; 
itemParamStrl = "artist"; 

break; 

} 

if ( ratingType == null ) 
{ 

engine = new SimilaritiesEngine( itemType.byteValueO, 
MAXJTEMS J"0_WRITE ); 

writer = new PrintWriter( new FileWriter( 
OUTPUT_FILENAME_ROOT + TYPE_STRING[ itemType.intValue() ] + 
OUTPUT__FILENAME_SUFFIX ) ); 

headerStrl = TYPE_STRING[ itemType.intValueQ ] + "s 



15 similar to ("; 

i 



} 

else 

{ 



ratingSQL = itemSQL; 
ratingParamStii = itemParamStrl; 
rating ParamStr2 = item Para mStr2; 



engine = new SimilaritiesEngine( itemType.byteValueO, 
ratingType. byteValueO, MAX_ITEMS_TO_WRITE ); 
"1 writer = new PrintWriter( new FileWriter( 

4 5 OUTPUTFILENAMEROOT + TYPE_STRING[ itemType.intValue() ] + 

TYPE_STRING[ ratingType.intValue() ] + OUTPUT_FILENAME_SUFFIX ) ); 
-f headerStrl = TYPE_STRING[ ratingType. intValue() ] + "s 

1 similar to " + TYPE_STRING[ itemType.intValueQ ] + " ("; 



switch ( ratingType. intValue() ) 
{ 

case Constants. ITEM_TYPE_SONG 

ratingSQL = "exec sp_lcGetSongDetail_xsxx " 
rating ParamStrl = "title"; 
ratingParamStr2 = "artist"; 

break; 

case Constants. ITEM TYPE ALBUM 

ratingSQL = "exec sp_lcGetAlbumDetail_xsxx 

ratingParamStrl = "albumName"; 
ratingParamStr2 = "artistName"; 
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break; 

case Constants. ITEM_TYPE_ARTIST 

ratingSQL = "exec sp_lcGetArtistlnfo_xsxx "; 
ratingParamStrl = "artist"; 

break; 

} 

} 

itemlDVec = engine.getltemlDs(); 

for ( int i = 0; i < itemlDVec.size(); i ++ ) 
{ 

itemID = (Integer) itemlDVec.elementAt( i ); 

headerStr2 = headerStrl + item ID + ") "; 

rs = DBConnection.executeSQL( itemSQL + itemID, false ); 



itemParamStr2 ); 



30 



i2o if ( rs.next() ) 

{ 

headerStr2 += rs.getString( itemParamStrl ); 

if ( itemParamStr2 != null ) 
{ 

headerStr2 += " by " + rs.getString( 

} 

} 

rs.close(); 

writer.println( headerStr2 ); 

35 groupRatingList = engine.getSimilar( itemID, 

MAX_SIMII_AR_ITEMS_PER_ITEM ); 

for ( int j = 0; j < groupRatingl_ist.size(); j ++ ) 
{ 

40 gRating = (GroupRating) groupRatingList.elementAt( j 

); 

gRatingValue = groupRatingl_ist.valueAt( j ); 
writer.print( "\t" + gRating.toBigStringQ + "\t" ); 
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30 



gRating, false ); 



ratingParamStr2 ) ); 



rs = DBConnection.executeSQL( ratingSQL + 

if ( rs.next() ) 
{ 

writer. print( rs.getString( ratingParamStrl ) ); 

if ( ratingParamStr2 != null ) 
{ 

writer.print( "\t" + rs.getString( 

} 

} 

rs.close(); 
writer. printlnQ; 



} 

writer.printlnQ; 



2 Debugger.out( "Generated " + groupRatingList.size() + " 

7 similarities for item " + itemID ); 
O } 



done." ); 



type]" ); 



writer.close(); 

Debugger.outTimerMIN( "DataFileGenerator", "DataFileGenerator 

} 

catch ( Instantiation Exception ie ) 

{ 

System. out.println(); 
System. outprintln( "usage:" ); 

System. out. println( " java DataFileGenerator [item type]" ); 
System.out.printlnj " java DataFileGenerator [item type] [rating 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 
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package com. launch. rm.lc.SimilaritiesEngine; 

import com.inet.tds.TdsDriver; 
import com. launch. rm.lc.PlayiistGenerator. Constants; 
5 import java.sql.*; 
import java.util.*; 



1^1 



i si 
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J** 

* A database connection. Carries out database operations such as executing 

* SQL queries. There is only one static connection object, which can 

* create multiple statements for executing SQL and return multiple 
result sets. 



* ©author Jeff Boulter 

* ©author John Veilleux 

*/ 

public class DBConnection 

{ 

private final static String DEFAULTCONNJD = "DEFAULT"; 
private static Driver dbDriver = null; 
private static Hashtable connHash = new Hashtable(); 
private static Hashtable connectStrHash = new Hashtable(); 

static 
{ 

connectStrHash. put( DEFAULT_CONN_ID, "jdbc:inetdae: 

+ Constants. DB_SERVER 
_j_ 11.1t 

+ Constants. DB_PORT 
+ "?sql7=true&database=" 
+ Constants. DB_DBNAME 
+ "&user=" 

+ Constants.DB_USERNAME 
+ "&password=" 

+ Constants. DB_PASSWORD ); 



j-k-k 

* Adds a database connection ID and info to the pool. 

* 

* @param connlDStr the ID of the new connection 

* @param connectStr the connection info 
*/ 

public final static void addConnection( String connlDStr, String connectStr ) 
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{ 

connectStrHash.put( connlDStr, connectStr ); 

} 



* Initializes the Connection object and adds it to the pool, 

* or does nothing if the object is already initialized, 

* then returns it. 

* ©exception SQLException if a connection error occurs 

7 

private final static Connection initConnection( String connlDStr ) throws 
SQLException 
{ 

Connection conn; 
String url; 

conn = (Connection) connHash.get( connlDStr ); 
url = (String) connectStrHash.get( connlDStr ); 

if ( dbDriver == null ) 
{ 

dbDriver = new com.inet.tds.TdsDriver(); 

} 

if ( dbDriver != null && url != null && ( conn == null || conn.isClosed() ) ) 
{ 

conn = dbDriver.connect( url, null ); 
connHash.put( connlDStr, conn ); 

} 

return conn; 

} 

I** 

* Executes an SQL query. 

* 

* @param sql the query to execute 

* @param printSQL determines whether or not to print debug info 

* @return the result set for the query, or null if 

* an error occurs 
7 

public final static ResultSet executeSQL( String sql, boolean printSQL ) 
{ 
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return executeSQL( D E FAU LT_CO N N_l D , sql, printSQL ); 

} 

5 * Executes an SQL query. 

* @param sql the query to execute 

* @param printSQL determines whether or not to print debug info 

10 * ©return the result set for the query, or null if 

* an error occurs 
*/ 

public final static ResultSet executeSQL( String connlDStr, String sql, boolean 
printSQL ) 

{ 

« Connection conn; 

% ResultSet rs; 

p{ Statement st; 

pp conn = null; 

y rs = null; 

y st = null; 

CI // jf we don't have a query, don't run it-it'll hang 

%1 25 if ( sql.lengthO <= 0 ) 

51 { 

;s; System. err. println( new java.util.Date() + " 

2 DBConnection.executeSQL; can't run empty SQL query." ); 

30 return null; 

} 

if ( printSQL ) 
{ 

35 System. out.println( "Running SQL: " + sql ); 

} 

try 
{ 

40 conn = initConnection( connlDStr ); 

st = conn.createStatement(); 

st.execute( sql ); 

45 rs = st.getResultSet(); 

} 
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catch ( SQLException sqle ) 
{ 

System.err.println( new java.util.Date() + " Error running SQL: " + 

sql); 

sqle.printStackTrace(); 

} 

return rs; 

} 

J** 

* Executes an SQL update. 

* @param sql the update to execute 

* @param printSQL determines whether or not to print debug info 
*/ 

public final static void executeUpdate( String sql, boolean printSQL ) 
{ 

executeUpdate( DEFAULT CONN ID, sql, printSQL ); 

} 

J** 

* Executes an SQL update. 

* 

* @param sql the update to execute 

* @param printSQL determines whether or not to print debug info 
*/ 

public final static void executeUpdate( String connlDStr, String sql, boolean 
printSQL ) 
{ 

Connection conn; 
Statement st; 

conn = null; 
st = null; 

// if we don't have a query, don't run It — it'll hang 

if ( sql.lengthO <= 0 ) 

{ 

System.err.println( new Java. util.Date() + " 
DBConnection.executeUpdate: can't run empty SQL query." ); 

return; 

} 

if ( printSQL ) 
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{ 

System.out.println( "Running SQL: " + sql ); 

} 

try 
{ 

conn = initConnection( connlDStr ); 
st = conn.createStatement(); 

st.executeUpdate( sql ); 

} 

catch ( SQLException sqle ) 
{ 

System. err.println( new java.util.Date() + " Error running SQL: " + 

sql); 

sqle.printStackTrace(); 

} 

} 

20 / 

* Gets a DBPreparedStatement object given an SQL query. 

* @param sql the query to prepare 

* 

25 * @return the prepared statement 

* ©exception SQLException if a database error occurs 

7 

public final static PreparedStatement prepareStatement( String sq! ) throws 
so SQLException 

{ 

return prepareStatement( DEFAULT CONNJD, sql ); 

} 

35 / 

* Gets a DBPreparedStatement object given an SQL query. 

* @param sql the query to prepare 

40 * @return the prepared statement 

* ©exception SQLException if a database error occurs 
7 

public final static PreparedStatement prepareStatement( String connlDStr, String 
45 sql ) throws SQLException 
{ 
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PreparedStatement ps; 
Connection conn; 

ps = null; 

conn = initConnection( connlDStr ); 

if ( conn != null ) 
{ 

ps = conn.prepareStatement( sql ); 

} 

return ps; 

} 

J** 

* Closes a single database connection. It is removed from 

* the pool of connections. 

* @param connlDStr the connection ID 
*/ 

public final static void closeConnection( String connlDStr ) 
{ 

Connection conn; 

conn = (Connection) connHash.get( connlDStr ); 

try 
{ 

connHash.remove( connlDStr ); 

if ( conn != null ) 
{ 

conn.close(); 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 

} 

j-k-k 

* Closes all database connections in the pool. 
*/ 

public final static void closeAIIConnections() 
{ 
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Connection conn; 
String connlDStr; 

conn = null; 
connlDStr = null; 

for ( Enumeration enum = connHash.keys(); enum.hasMoreElements(); ) 
{ 

try 

{ 

connlDStr = (String) enum.nextElement(); 
conn = (Connection) connHash.get( connlDStr); 

connHash.remove( connlDStr ); 
conn.close(); 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 

} 
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package com. launch. rm.lc.SimilaritiesEngine; 

import java.util.Hashtable; 
import java.io.*; 



I** 

* This class handles all debugging functions, such as debug output, 

* for the SimilaritiesEngine package. 

* 

* ©author John Veilleux 
*/ 

public class Debugger 

{ 

private static Hashtable timerHash = new Hashtable(); 
private static PrintStream outStream = new PrintStream( System. out ); 

static 

if ( SimiiaritiesConstants. DEBUG && SimilaritiesConstants.LOGFILE ) 

{ 

try 

outStream = new PrintStream( new FileOutputStream( 

25 "SimilaritiesLog.txt" ) ); 

} 

catch ( Exception e ) 

System.err.println( "Could not create log file.. .debug info will 
30 be printed to standard out." ); 

} 

} 

} 



* Outputs the given message if debug mode is on. 

* 

* @param message the message to print 
*/ 

public final static void out( String message ) 
{ 

if ( SimilaritiesConstants.DEBUG ) 
{ 

outStream. println( "DEBUGGER: " + message ); 

} 
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} 

j-kie 

* Outputs the given message with the current timer value in 
5 * both milliseconds and minutes if debug mode is on. 

*@param timerKey the timer ID 

* @param message the message to print 
*/ 

10 public final static void outTimer( Object timerKey, String message ) 

{ 

if ( SimilaritiesConstants. DEBUG ) 
{ 

if ( timerHash.get( timerKey ) != null ) 
{ 

outStream.println( "DEBUGGER (" + getTimerMS( timerKey 
) + " MS or " + getTimerMIN( timerKey ) + " MIN): " + message ); 
} 

else 

20 { 

outStream.println( "DEBUGGER (NO TIMER FOUND): " + 

message ); 

} 

} 

} 

* Outputs the given message with the current timer value in 

* milliseconds if debug mode is on. 

* 

30 

* @param timerKey the timer ID 

* @param message the message to print 
*/ 

public final static void outTimerMS( Object timerKey, String message ) 
{ 

if ( SimilaritiesConstants.DEBUG ) 
{ 

if ( timerHash.get( timerKey ) != null ) 

40 ^ outStream.println( "DEBUGGER (" + getTimerMS( timerKey 

) + " MS): " + message ); 

} 

else 
{ 

outStream.println( "DEBUGGER (NO TIMER FOUND): " + 

message ); 
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} 

} 

} 

I** 

* Outputs the given message with the current timer value in 

* minutes if debug mode is on. 

* 

* @param timerKey the timer ID 

* @param message the message to print 

7 

public final static void outTimerMIN( Object timerKey, String message ) 
{ 

if ( SimilaritiesConstants.DEBUG ) 
{ 

if ( timerHash.get( timerKey ) != null ) 
{ 

outStream.println( "DEBUGGER (" + getTimerMIN( timerKey 

) + " MIN):" + message ); 

} 

else 
{ 

outStream.println( "DEBUGGER (NO TIMER FOUND): " + 

message ); 

} 

} 

} 

jkk 

* Resets the timer. 

* 

* @param timerKey the timer ID 
7 

public final static void resetTimer( Object timerKey ) 
{ 

timerHash.put( timerKey, new Long( System.currentTimeMillis() ) ); 

} 

* Gets the timer's current value in milliseconds. 

* 

* @param timerKey the timer ID 

* 

* ©return the timer's value in milliseconds 
*/ 

public final static long getTimerMS( Object timerKey ) 
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{ 

Long timerMS; 

timerMS = (Long) timerHash.get( timerKey ); 

5 

return System. currentTimeMillis() - timerMS. longValue(); 

} 

jit* 

10 * Gets the timer's current value in minutes. 

* 

*@param timerKey the timer ID 

* @return the timer's value in minutes 

*/ 

O public final static int getTimerMIN( Object timerKey ) 

f { 

Long timerMS; 

% 0 timerMS = (Long) timerHash.get( timerKey ); 

*i return (int) ( ( System.currentTimeMillis() - timerMS.IongValue() ) / 60000 

T ); ^ 

XI25 } 
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package com. launch. rm.lc.SimilaritiesEngine; 



j-kk 

5 * This class calculates the group rating for a single item. The 

* value is calculated by multiplying the total number of ratings 

* by the sum of the average of the ratings with some specified 

* offset. 

10 * @author Jeff Boulter 

* ©author John Veilleux 

7 

public class GroupRating 

{ 

CM private int itemID; 

ii private int numRatings = 0; 

] § private int ratingsSum - 0; 

W private double value = 0; 

2~ private double average = 0; 

private boolean stale = true; 

5 a i 

S&is J** 

S| * Creates a GroupRating object. 

ii * 

1^-25 

Q * @param itemlDthe item ID 

Q */ 

^ public GroupRating( int itemID ) 

{ 

30 this. itemID = itemID; 

} 

j-k-k 

* Gets the item ID associated with this group rating. 

35 

* ©return the item ID 

V 

public int getltemlD() 
{ 

40 return itemID: 

} 

j-k-k 

* Adds a rating to be used in the calculation of this object's 
45 * value. 
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* @param r the rating 
7 

public void add( Rating r ) 
{ 

numRatings ++; 
ratingsSum += rvalue; 
stale = true; 

} 

I** 

* Gets the final value of this object. If the value hasn't 

* been calculated yet, it is calculated and then returned. 

* @return this object's value 

7 

public double get() 
{ 

if ( stale ) 
{ 

calculate(); 
return value; 

} 

J** 

* Gets the number of ratings added to this object. 

* @retum the rating count 
7 

public int getNumRatings() 
{ 

return numRatings; 

} 

* Gets a String representation of this object. 

* 

* ©return the String description 

7 

public String toString() 
{ 

return String.valueOf( itemID ); 

} 
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* Gets a more complete String representation of this object. 

* ©return the String description 

7 

public String toBigString() 
{ 

return "itemID: " + itemID + ", # of ratings: " + numRatings + ", sum of 
ratings: " + ratingsSum + ", average: " + average() + ", score: " + get(); 
} 

* Gets the average value of all of this object's ratings. 

* 

* ©return the rating average 
*/ 

private double average() 
{ 

if ( stale ) 
{ 

if ( numRatings <= 0 ) 
{ 

average = 0; 

} 

else 
{ 

average = ( (double)ratingsSum ) / ( (double)numRatings ); 

} 

} 

return average; 

} 

I** 

* Calculates the value for this object. 
*/ 

private void calculate() 

{ 

value = numRatings * ( average() + 
SimilaritiesConstants.GR_AVG_OFFSET); 
stale = false; 

} 
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package com. launch. rm.lc.SimilaritiesEngine; 
import java.util.Vector; 



jkk 

* This class represents a list of Ordered Element objects. They 

* are sorted from highest to lowest value using quicksort. The 

* sorting is done on demand whenever any information contained in 
10 * this object is accessed. 

* ©author Jeff Boulter 

* ©author John Veilleux 

7 

15 public class OrderedList 

I { 

private Vector list; 

private boolean sorted = false; 



* This inner class represents an element used by OrderedList. It contains 

* two fields that are accessed directly: a value and an associated 

* object. OrderedList sorts these objects by the value field. 

7 

private class Ordered Element 

private double value; 
private Object thing; 




j-k-k 

* Creates an Ordered Element object. 

* 

* @param value this object's value 

* @param thing the object associated with the given value 
*/ 

private Ordered Element( double value, Object thing ) 
{ 

this.value = value; 
this.thing = thing; 

} 



jkit 

* Creates an OrderedList object. 
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*/ 

public Ordered List() 
{ 

list = new Vector(); 

} 

J** 

* Creates an OrderedList object with an initial size. 

* 

* @param size the initial size 
*/ 

public Orderedl_ist( int size ) 
{ 

list = new Vector( size ); 

} 

j-k-k 

* Creates an OrderedList object with an initial size and a 

* capacity increment. 

* @param size the initial size 

* @param capacitylncrement the capacity increment 
7 

public Orderedl_ist( int size, int capacitylncrement ) 
{ 

list = new Vector( size, capacitylncrement ); 

} 

* Gets the object at the specified index. 

* @param int the index position of the object 

* 

* ©return the object, or null if no object could be 

* retrieved at the given index 
*/ 

public Object elementAt( int i ) 
{ 

Object obj = null; 
OrderedElement e = null; 

if ( Isorted ) 
{ 

sort(); 

} 
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e = (OrderedElement) list.elementAt( i ); 

if ( e != null ) 
{ 

obj = e.thing; 

} 

return obj; 

} 

J** 

* Gets the value at the specified index. 

* 

* @param i the index position of the value 

* 

* ©return the value, or null if no value could be 

* retrieved at the given index 

*/ 

public double valueAt( int i ) 
{ 

double value = 0; 
OrderedElement e = null; 

if ( (sorted ) 
{ 

sort(); 

} 

e = (OrderedElement) list.elementAt( i ); 

if ( e != null ) 
{ 

value = e.value; 

} 

return value; 

} 

I** 

* Gets the number of elements in the list. 

* 

* ©return the list size 

*/ 

public int size() 
{ 

return list.sizeQ; 
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} 

I** 

* Truncates the list to the specified size. Nothing happens 

* if the list is already equal to or smaller than the given 

* size. 

* @param size the maximum size 
7 

public void trimToMaximumSize( int size ) 
{ 

if ( Isorted ) 
{ 

sort(); 

} 

if ( list.size() > size ) 
{ 

list.setSize( size ); 

} 

} 

jit* 

* Gets this list as a Vector of the objects associated with 

* each element in this list. 

* ©return the Vector of objects 
7 

public Vector asVector() 
{ 

Vector result = new Vector(); 

if ( Isorted ) 
{ 

sortO; 

} 

for ( int i = 0; i < list.size(); i ++ ) 
{ 

result.addElement( elementAt( i ) ); 

} 

return result; 

} 

J** 
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* Gets a String representation of this object. 

* 

* ©return the String description 
*/ 

public String toString() 
{ 

String result = "("; 

if ( Isorted ) 
{ 

sort(); 

} 

for ( int i = 0; i < list.size(); i ++ ) 

* result += elementAt( i ) + ", "; 
} 

result += ")"; 
return result; 

} 

jit it 

* Adds a value/object pair to the list. 
* 

* @param value the value 

* @param object the object 
*/ 

public void add( double value, Object toStore ) 
^ list.addElement( new OrderedElement( value, toStore ) ); 
sorted = false; 

} 

jit it 

* Removes an element from the list. 

* 

*@param index the index of the element to remove 
*/ 

public void removeElementAt( int index ) 
{ 

list.removeElementAt( index ); 

} 



43 



^Patent 
01-9774 



* Sorts this object. 
7 

private void sort() 
{ 

sort( list, 0, list.size() - 1 ); 
sorted = true; 

} 

10 

I* it 

* Performs quick sort on a vector. 

* @param a the vector to sort 
i 5 * @param from the starting index for the sort 

* @param to the ending index for the sort 

7 

private final static void sort( Vector a, int from, int to ) 
{ 

20 int i = from; 

int j = to; 

OrderedElement center = null; 
OrderedElement temp = null; 

25 if ( a == null || a.size() < 2 ) 

{ 

return; 

} 

so center = (OrderedElement) a.elementAt( ( from + to ) / 2 ); 

do 
{ 

while ( i < to && center.value < ( (OrderedElement) a.elementAt( 

35 ). value ) 

{ 

i ++; 

} 

40 while ( j > from && center.value > ( (OrderedElement) a.elementAt( 

j ) ).value ) 

{ 

j--; 

} 

if(i<j) 
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{ 

// swap elements 

temp = (Ordered Element) a.elementAt( i ); 

a.setElementAt( a.elementAt( j ), i ); 
a.setElementAt( temp, j ); 

} 

if(i<=j) 
{ 

i++; 
j-; 

} 

} 

while( i <= j ); 

if ( from < j ) 
{ 

sort( a, from, j ); 

} 

if ( i < to ) 
{ 

sort( a, i, to ); 

} 
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package com. launch. rm.lc.SimilaritiesEngine; 



I** 

* This class represents a rating. It includes three fields: an item 

* ID, a user ID, and a value. The fields are accessed directly. 

* ©author Jeff Boulter 

* ©author John Veilleux 
*/ 

public class Rating 

{ 

public int itemID; 
public int userlD; 
public byte value; 



I** 

* Creates a Rating object. 

* 

* @param itemID the ID of the item this rating is for 

* @param userlD the ID of the user who created the rating 

* @param value the actual rating value 
*/ 

public Rating( int itemID, int userlD, byte value ) 

{ 

this. itemID = itemID; 
this.userlD = userlD; 
this.value = value; 

} 

J** 

* Gets a String representation of this object. 

* ©return the String description 
*/ 

public String toString() 

^ return "Rating: [itemID: " + itemID + ", userlD: " + userlD + ", value: " + 

+ "]"; 
} 



35 

40 value 
} 
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package com. launch. rm.lc.SimilaritiesEngine; 
import com. launch. utils.PropertiesFileReader; 



* Constants used within the SimilaritiesEngine code. Changing 

* certain parameters can significantly change the amount of memory 

* used. For instance, each rating loaded into the engine uses about 

* 30 bytes of memory, so increasing M AX_RATI N G S_l N__EN G I N E by 1 

* million ratings could potentially use an extra 30 MB of memory. 

* Each fan under MAX_FANS_PER_iTEM uses about 23 bytes, so 

* MAX_ITEMS_TO_STORE times MAX_FANS_PERJTEM times 23 bytes gives 

* you the potential maximum amount of memory taken up by those 

* parameters. The ITEM_TO_ARTIST__CACHE__MAX_SIZE entries each use 

* up about 71 bytes of memory. A cache with 15,000 entries will 

* use about 1 MB of memory. 

* @author Jeff Boulter 

* ©author John Veilleux 

*/ 

public class SimilaritiesConstants 

{ 

private final static PropertiesFileReader pfr = new PropertiesFileReader( 
"SimilaritiesConstants. properties" ); 

private static int maxRatingslnEngine; 

private static String fileNames[] = { "", "", "", }; 

private static long updateSimilaritiesTimeMS; 

private static short maxltemsToStore[] = { 0, 0, 0, 0 }; 

private static int maxSimilarltemsPerltem; 

private static byte fanThreshold; 

private static int maxFansPerltem; 

private static int minFansForRecommendedltem; 

private static int grAvgOffset; 

private static int itemToArtistCacheMaxSize[] = { 0, 0, 0, 0 }; 
private static boolean debug; 
private static boolean logfile; 

static 
{ 

maxRatingslnEngine = pfr.getlntProperty( "MAX_RATINGSJN_ENGINE", 

30000000 ); 

fileNames[ 1 ] = pfr.getProperty( "SONG__RATINGS__FILE", 
"WexportWsongratings.txt" ); 

fileNamesf 2 ] = pfr.getProperty( !, ALBUM_RATINGS_FILE", 
"WexportWalbumratings.txt" ); 
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fileNames[ 3 ] = pfr.getProperty( "ARTISTRATINGSFILE", 
"WexportWartistratings.txt" ); 

fileNames[ 0 ] = fileNames[ 3 ]; 

updateSimilaritiesTimeMS = pfr.getLongProperty( 
"UPDATE_SIMILARITIES_TIME_MS", 1000 * 60 * 60 * 24 * 14 ); 

maxltemsToStore[ 1 ] = pfr.getShortProperty( 
"MAX_SONGS_TO_STORE", (short)15000 ); 

maxltemsToStore[ 2 ] = pfr.getShortProperty( 
"MAX_ALBUMS_TO_STORE", (short) 10000 ); 

maxltemsToStore[ 3 ] = pfr.getShortProperty( 
"MAX.ARTISTSjrCLSTORE", (short)3000 ); 

maxltemsToStore[ 0 ] = maxltemsToStore[ 3 ]; 

maxSimilarltemsPerltem = pfr.getlntProperty( 
"MAX_SIMILAR_ITEMS_PER_ITEM", 100 ); 

fanThreshold = pfr.getByteProperty( "FAN_THRESHOLD", (byte)90 ); 

maxFansPerltem = pfr.getlntProperty( "MAX_FANS_PER_ITEM", 300 ); 

minFansForRecommendedltem = pfr.getlntProperty( 
"MIN_FANS_FOR_RECOMMENDED_ITEM", 4 ); 

grAvgOffset = pfr.getlntProperty( "GR_AVG_OFFSET", -70 ); 

itemToArtistCacheMaxSize[ 1 ] = pfr.getlntProperty( 
"SONG_TO_ARTIST_CACHE_MAX_SIZE", 300000 ); 

itemToArtistCacheMaxSize[ 2 ] = pfr.getlntProperty( 
"ALBUM_TO_ARTIST_CACHE_MAX_SIZE", 150000 ); 

debug = pfr.getBooleanProperty( "DEBUG", true ); 

logfile = pfr.getBooleanProperty( "LOGFILE", false ); 

} 

// the maximum number of ratings that the engine can load without 
// running out of memory 

public final static int MAXJRATINGSJNENGINE = maxRatingslnEngine; 
// the file names for the corresponding item type 

// the array is indexed as { 0 = default (artists), 1 = songs, 2 = albums, 3 = artists 

} 

public final static String FILE_NAMES[] = fileNames; 

// the expiration time for similarities in the database 
public final static long UPDATE_SIMILARITIES_TIME_MS = 
updateSimilaritiesTimeMS; 

// the maximum number of items with similar items to be stored in the database 
// the array is indexed as { 0 = default (artists), 1 = songs, 2 = albums, 3 = artists 

} 

public final static short MAX_ITEMS_TO_STORE[] = maxltemsToStore; 
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// the maximum number of similar items to retrieve per item 
public final static int M AX_S I M I L AR J TE M S_P E R J TE M = 
maxSimilarltemsPerltem; 

// the user's minimum rating for an item to be considered a fan 
public final static byte FAN_THRESHOLD = fanThreshold; 

// maximum number of fans to get for an item 

public final static int M AX_F A N S_P E R_l TE M = maxFansPerltem; 

// the minimum number of ratings an item needs to be considered as a similar 

item 

public final static int MIN_FANS_FOR_RECOMMENDEDJTEM = 
minFansForRecommendedltem; 

// used when calculating the average part of a group rating 
public final static int G R_AVG__0 F F S ET = grAvgOffset; 

// used to determine the maximum size of the cache that maps 
//item ID's to artist I D's 

// the array is indexed as { 0 = default (artists), 1 = songs, 2 = albums, 3 = artists 

} 

public final static int ITEM„TO_ARTIST_CACHE_MAX„SIZE[] = 
itemToArtistCacheMaxSize; 



// determines whether or not to print debug output 
public final static boolean DEBUG = debug; 

// in debug mode, determines whether to print debug info to a 
// file or to the screen 

public final static boolean LOGFILE = logfile; 
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package com. launch. rm.lc.SimilaritiesEngine; 

import com. launch. rm.lc.PlaylistGenerator.*; 
import Java. util.*; 
5 import java.io.*; 
import java.sql.*; 



iy ■ 
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jie-k 

* This class represents the engine which churns out the item 

* similarities. The files from which the ratings are pulled must 

* be grouped by user. 

* 

* ©author Jeff Boulter 

* ©author John Veilleux 
*/ 

public class SimilaritiesEngine 

{ 

private byte itemType = 0; 

private Hashtable userRatingsHash = new Hashtable(); 
private Hashtable itemToFanlDsHash = new Hashtable(); 
private Hashtable itemToArtistCache = null; 
private final static String CACHE_CONN_ID = "CACHE"; 

static 
{ 

DBConnection.addConnection( CACHE_CONN_ID, ,f jdbc:inetdae:" 
+ Constants. DB^SERVER 

J H.lt 

+ Constants. DB_PORT 
+ "?sql7=true" 

+ "&database=dbLaunchProd" 
+ "&user=" 

+ Constants.DB_USERNAME 
+ "&password=" 

+ Constants.DB_PASSWORD ); 



J** 

* Creates a SimilaritiesEngine object. 

* 

* @param itemType the item type for which similarities will 

* be generated 

* 

* @param numltems the number of items that will have 
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* similarities generated for them 

7 

public SimilaritiesEngine( byte itemType, int numltems ) 
{ 

IntHash itemsToExciude; 

LineNumberReader reader; 

String line; 

StringTokenizer st; 

int itemID; 

int userlD; 

byte rating; 

Vector userRatings; 

int lastUserlD; 

boolean lastUserWasFan; 

int randomStartLine; 

int numltemsWithMaxFans; 

boolean allFansLoaded; 

Vector fan I DsVec; 

int numFileRatings[]; 

int portionToLoad; 

int totalRatingsLoaded; 

this. itemType = itemType; 
itemsToExciude = null; 
reader = null; 
line = null; 
st = null; 
itemID = 0; 
useriD = 0; 
rating = 0; 
userRatings = null; 
lastUserlD = -1; 
lastUserWasFan = false; 
randomStartLine = 0; 
numltemsWithMaxFans = 0; 
allFansLoaded = false; 
fanlDsVec = null; 

itemToArtistCache = new Hashtable( 
SimilaritiesConstants.lTEM__TO_ARTIST_CACHE_MAX_SIZE[ itemType ] ); 
numFileRatings = new int[]{ 0 }; 
portionToLoad = 1 ; 
totalRatingsLoaded = 0; 

try 
{ 

itemsToExciude = getltemsToExclude( itemType ); 
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Debugger.out( "There were " + itemsToExclude.size() + " items that 
already had similarities in the database and don't need to be updated yet." ); 

Debugger.out( "Now getting items with the most total ratings..." ); 
Debugger.resetTimer( "getltemsWithMostRatings" ); 

itemToFanlDsHash = getltemsWithMostRatings( itemType, 
numltems, itemsToExclude, numFileRatings ); 

Debugger.outTimer( "getltemsWithMostRatings", "Done getting 
items with the most total ratings. # of items: " + itemToFanlDsHash.size() ); 

portionToLoad = ( numFileRatings[ 0 ] / 
SimilaritiesConstants.MAX_RATINGS_IN_ENGINE ) + 1; 

randomStartLine = (int) Util.random( numFileRatings[ 0 ] ) + 1; 

reader = new LineNumberReader( new FileReader( 
SimilaritiesConstants.FILE_NAMES[ itemType ] ) ); 

Debugger.out( "Engine will load no more than 1/" + portionToLoad 
+ " of " + numFileRatings[ 0 ] + " total ratings in file." ); 

Debugger.out( "Starting to read ratings file up through random line " 
+ randomStartLine ); 

for ( int i = 1 ; i <= randomStartLine; i ++ ) 
{ 

line = reader.readLine(); 

} 



30 randomStartLine); 



Debugger.out( "Done reading file up through random line " + 

Debugger.out( "Now queuing up file to first line of next user..." ); 

line = readUpToNextUser( line, reader ); 
randomStartLine = reader.getLineNumber(); 

if ( line == null ) 

reader = new LineNumberReader( new FileReader( 
SimilaritiesConstants.FILE_NAMES[ itemType ] ) ); 

line = reader.readLine(); 

randomStartLine = reader.getLineNumber(); 

} 

Debugger.out( "Done queuing up file to first line of next user." ); 
Debugger.out( "Now loading ratings into engine..." ); 
Debugger.resetTimer( toStringQ ); 
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do 



: ~20 



if ( reader.getLineNumber() % portionToLoad == 0 ) 



{ 



lastUserlD ), userRatings ); 
userRatings.sizeQ; 



-|25 numitems; 



K> rating ) ); 



new lnteger( itemID ) ); 



st = new StringTokenizer( line, "," ); 
itemID = lnteger.parselnt( st.nextToken() ); 
userlD = Integer.parselnt( st.nextToken() ); 
rating = Byte.parseByte( stnextToken() ); 

if ( userlD != lastUserlD ) 
{ 

if ( lastUserWasFan ) 
{ 

lastUserWasFan = false; 
userRatingsHash.put( new Integer( 
totalRatingsLoaded += 

} 

lastUserlD = userlD; 

allFansLoaded = numltemsWithMaxFans == 
userRatings = new VectorQ; 



} 



userRatings. addElement( new Rating( itemID, userlD, 



if ( rating >= SimilaritiesConstants.FAN_THRESHOLD 



{ 



fanlDsVec = (Vector) itemToFanlDsHash.get( 



if ( fanlDsVec != null && fanlDsVec.size() < 
SimilaritiesConstants.MAX_FANS_PER_ITEM ) 

{ 

lastUserWasFan = true; 



userlD ) ); 



fanlDsVec.addElement( new lnteger( 
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if ( fanlDsVec.size() == 
SimilaritiesConstants.MAX_FANS_PERJTEM) 

{ 
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numltemsWithMaxFans ++; 
} 

} 

} 

} 

line = reader.readl_ine(); 

if ( line == null ) 
{ 

Debugger.out( "Read past end of " + 
SimilaritiesConstants.FILE_NAMES[ itemType ] ); 

reader. close(); 

reader = new LineNumberReader( new FileReader( 
SimilaritiesConstants.FILE_NAMES[ itemType ] ) ); 

line = reader.readLine(); 

} 

while ( lallFansLoaded && reader.getl_ineNumber() != 

randomStartLine ); 

reader.close(); 

if ( lastUserWasFan ) 

userRatingsHash.put( new lnteger( userlD ), userRatings ); 
total RatingsLoaded += userRatings. size(); 

} 

Debugger.outTimer( toString(), "Done loading " + 
totalRatingsLoaded + " ratings into engine." ); 

Debugger.out( numltemsWithMaxFans + " out of " + 
itemToFanlDsHash.size() + " items had maximum of " + 
SimilaritiesConstants.MAX_FANS_PER_ITEM + " fans." ); 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 

} 
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* Gets a sorted list of items similar to the given item. The 

* specified item ID must have been one of the candidates to 

* have similarities generated for it. 

5 

* @param itemID the ID of the item to get similar items for 

* 

* ©return the list of similar items, or an empty 

* list if the item ID wasn't included in the 
10 * similarities calculations 

*/ 

public OrderedList getSimilar( int itemID ) 
{ 

OrderedList result; 
15 Consensus c; 

Vector fan IDs; 
Vector userRatings; 

result = new Orderedl_ist(); 
20 c = new Consensus( itemID ); 

fanlDs = (Vector) itemToFanlDsHash.get( new lnteger( itemID ) ); 
userRatings = null; 

if (fanlDs != null) 
{ 

for ( int i = 0; i < fanlDs.size(); i ++ ) 
{ 

userRatings = (Vector) userRatingsHash.get( 

fanlDs. elementAt( i ) ); 

30 

c.addRatings( userRatings ); 

} 

result = c.poll(); 

if ( itemType == Constants.lTEM_TYPE_SONG ) 
{ 

removeltemsWithSameArtist( itemID, result, "exec 
sp_lcGetSongDetail_xsxx ", itemToArtistCache, 
40 SimilaritiesConstants.lTEM_TO_ARTIST_CACHE_MAX_SIZE[ itemType ] ); 

} 

else if ( itemType == Constants. ITEM_TYPE_ALBUM ) 
{ 

removeltemsWithSameArtist( itemID, result, "exec 
45 sp_lcGetAlbumDetail_xsxx ", itemToArtistCache, 

SimilaritiesConstants.lTEM_TO_ARTIST_CACHE_MAX_SIZE[ itemType ] ); 
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} 

} 

else 
{ 

5 Debugger.out( "The item with ID " + itemID + " was not one of the 

items that had similarities calculated for it." ); 

} 

return result; 

} 

* Gets a sorted list of items similar to the given item. The 

* specified item ID must have been one of the candidates to 
is * have similarities generated for it. 

* @param itemID the ID of the item to get similar items for 
CI * @param maxltems the maximum number of similar items to 

O * retrieve 

y ^ * 
2?° 

w * @return the list of similar items, or an empty 

ff % * list if the item ID wasn't included in the 

^ * similarities calculations 

h */ 

-*|25 public OrderedList getSimilar( int itemID, int maxltems ) 

O OrderedList result; 

M* result = getSimilar( itemID ); 

30 

result.trimToMaximumSize( maxltems ); 
return result; 

} 

35 

j-k-k 

* Gets all item ID's that need to have their similarities 

* generated. 

40 * ©return the item ID's 

7 

public Vector getltemlDs() 
{ 

Vector idVec; 

45 

idVec = new Vector( itemToFanlDsHash.sizeQ ); 
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for ( Enumeration e = itemToFanlDsHash.keys(); e.hasMoreElements(); ) 
{ 

idVec.addElement( e.nextElement() ); 

} 

return idVec; 

} 

10 / 

* Gets an inthash of item ID's to exclude from similarities 

* generation. 

* @param type the item type 

ass. * 
U5 

Cl * @return the item ID's to exclude 

§ 7 

Cj private final static IntHash getltemsToExclude( byte type ) 

tfso IntHash toExclude; 

Jj ResultSet rs; 

!" Timestamp lastUpdatedTime; 

%l toExclude = new IntHash(); 

hhs rs = null; 

O lastUpdatedTime = null; 

CI 

^ try 
{ 

so rs = DBConnection.executeSQL( "exec 

usp_a10xSimilar_GetAIISimilarltems_xsxx " + type, false ); 

while ( rs.next() ) 
{ 

35 lastUpdatedTime = rs.getTimestamp( "dateCreated" ); 

if ( System. currentTimeMillis() - lastUpdatedTime.getTime() 
< SimilaritiesConstants.UPDATE_SIMII_ARITIES_TIME_MS ) 

{ 

40 toExclude. increment rs.getlnt( "itemID" ) ); 

} 

} 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 
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} 

return toExclude; 



/* 



Gets a hashtable with item ID's as the keys and an empty 

* inthash for each item. There will only be up to specified 

* maximum number of item ID'S in the hashtable, and they will 

* be chosen from most to least total ratings. 



@param 
@param 



type 

maxltems 



@param itemsToExclude 



hashtable 

* @param 



numLines 



©return 



the item type 

the maximum number of items to return 

in the hashtable 
a group of item ID's to definitely 

exclude from the returned 

a one-element array for storing the 

number of lines in the ratings file 

the hashtable of item ID's each with 
an associated inthash 



*/ 



^ |25 



30 



private final static Hashtable getltemsWithMostRatings( byte type, int maxltems, 
IntHash itemsToExclude, int numLines[] ) 
{ 

Hashtable resultHash; 
LineNumberReader reader; 
StringTokenizer st; 
int itemID; 

IntHash numRatingsHash; 
OrderedList mostRatingsltemlDList; 
int resultSize; 



resultHash = new Hashtable( maxltems ); 
reader = null; 
st = null; 
itemID = 0; 

numRatingsHash = new lntHash(); 
mostRatingsltemlDList = new OrderedList(); 
resultSize = 0; 



try 
{ 

reader = new LineNumberReader( new FileReader( 
SimilaritiesConstants.FILE_NAMES[ type ] ) ); 
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for ( String line = reader. read Line(); line != null; line = 

reader. readLine() ) 

{ 

st = new StringTokenizer( line, "," ); 
itemID = lnteger.parselnt( st.nextToken() ); 

if ( itemsToExclude.get( itemID ) == 0 ) 
{ 

numRatingsHash. increment itemID ); 

} 

} 

numl_ines[ 0 ] = reader.getl_ineNumber(); 

for ( Enumeration e = numRatingsHash.keys(); 
e.hasMoreElementsO; ) 

{ 

itemID = ( (Integer) e.nextElement() ).intValue(); 

mostRatingsltemlDList.add( (double) numRatingsHash.get( 
itemID ), new lnteger( itemID ) ); 
} 

resultSize = Math.min( mostRatingsltemlDList.size(), maxltems ); 
for ( int i = 0; i < resultSize; i ++ ) 

resultHash.put( mostRatingsltemlDList.elementAt( i ), new 



30 VectorQ ); 



} 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 

return resultHash; 



} 

J** 



Removes similar items from the given list that have the same 
artist as the given item. 

@param itemID the ID of the item whose artist should not 
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* be the same as any artists for the 

items 

* in the given list of similar items 

* @param simList the list of items similar to the given item 

5 * @param sql the sql needed for retrieving the artist ID 

* @param cache the cache with item ID's mapped to artist ID's 

* @param maxCacheSize the maximum size of the given cache 
*/ 

private final static void removeltemsWithSameArtist( int itemID, OrderedList 
10 simList, String sql, Hashtable cache, int maxCacheSize ) 
{ 

ResultSet rs; 
Integer itemlDInt; 
Integer artistID; 
15 Integer otherltemID; 

Integer otherArtistID; 

jn rs = null; 

|j itemlDInt = new lnteger( itemID ); 

y:b artistID = (Integer) cache. get( itemlDInt ); 

D otherltemID = null; 

W otherArtistID = null; 



30 



35 



try 
{ 



itemID, false ); 



if (artistID == null) 
{ 

rs = DBConnection.executeSQL( CACHE_CONN_ID, sql + 



if ( rs.next() ) 
{ 

artistID = new lnteger( rs.getlnt( "artistID" ) ); 

if ( cache. size() < maxCacheSize ) 
{ 

cache. put( itemlDInt, artistID ); 

} 

} 

else 
{ 



} 

} 



artistID = new lnteger( -1 ); 



for ( int i = simList.sizeQ - 1 ; i >= 0; i - ) 
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{ 

otherltemID = new lnteger( ( (GroupRating) 
simList.elementAt( i ) ).getltemlD() ); 

otherArtistID = (Integer) cache. get( otherltemID ); 



sql + otherltemID, false ); 



)); 



1); 



} 



if ( otherArtistID == null ) 
{ 

rs = DBConnection.executeSQL( CACHE_CONN_ID, 



if ( rs.next() ) 
{ 

otherArtistID = new lnteger( rs.getlnt( "artistID" 



if ( cache.size() < maxCacheSize ) 
{ 

cache.put( otherltemID, otherArtistID ); 

} 

} 

else 
{ 

otherArtistID = new lnteger( artistID. intValueQ 



} 

if ( artistlD.intValue() == otherArtistlD.intValue() ) 
{ 

simList.removeElementAt( i ); 

} 



} 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 



j-k-k 

* Reads through lines of a ratings file starting on the line 

* after the given line and returns the first line that has a 

* different user ID than the user ID in the given line. 



@param line the starting line 

@param readerthe object reading the ratings file 
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* ©return the first line with a different user, or null 

* if the end of the file is reached 
*/ 

5 private final static String readUpToNextUser( String line, LineNumberReader 

reader ) 
{ 

StringTokenizer st; 
int firstUserlD; 
10 int userlD; 

st = null; 
firstUserlD = 0; 
userlD = 0; 

15 

*! try 

E { 

q st = new StringTokenizer( line, "," ); 

m st.nextToken(); 

3 

^ userlD = lnteger.parselnt( st.nextToken() ); 

- firstUserlD = userlD; 

f %2 5 while ( userlD == firstUserlD ) 

? ! line = reader, read Line(); 

2 if ( line != null ) 

30 { 

st = new StringTokenizer( line, ); 
st.nextToken(); 

35 userlD = Integer.parselnt( st.nextToken() ); 

} 

else 
{ 



40 



userlD = firstUserlD - 1; 



} 



} 



} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 
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return line; 

} 

* Gets a String representation of this object. 

* ©return the String description 
*/ 

public String toString() 
{ 

return "Item Type: " + itemType; 

} 
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package com. launch. rm.lc.SimilaritiesEngine; 

import com. launch. rm.lc.PlaylistGenerator.*; 
import java.util.Vector; 



jkk 

* This class writes similarity data to the database. It takes the 

* item type from the command line. 

* 

* ©author John Veilleux 
*/ 

public class SimilaritiesGenerator 

{ 

* The main method. 

* @param args command line arguments 

7 

public static void main( String args[] ) 
{ 

Integer itemID; 
Byte itemType; 
SimilaritiesEngine engine; 
Vector itemlDVec; 
Vector similarlDVec; 
String sql; 

itemlD = null; 
itemType = null; 
engine = null; 
itemlDVec = new Vector(); 
similarlDVec = null; 
sql = null; 

try 
{ 

if ( args. length == 1 ) 
{ 

itemType = new Byte( args[ 0 ] ); 

} 

else 
{ 

throw new lnstantiationException(); 

} 
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if ( itemType.byteValue() < Constants. ITEM_TYPE_SONG || 
itemType.byteValue() > Constants.lTEM TYPE ARTIST ) 
{ 

throw new Exception( "Item type must be " + 
Constants. ITEM_TYPE_SONG + ", " + Constants. ITEM_TYPE_ALBUM + ", or " + 
Co n sta nts . IT E M_TYP E_ART I ST + "." ); 
} 

Debugger.out( "Similarities Generator started." ); 
Debugger.resetTimer( "SimilaritiesGenerator" ); 

engine = new SimilaritiesEngine( itemType.byteValue(), 
SimilaritiesConstants.MAX_ITEMS_TO_STORE[ itemType.intValue() ] ); 
itemlDVec = engine.getltemlDs(); 

for ( int i = 0; i < itemlDVec.size(); i ++ ) 
{ 

itemID = (lnteger)itemlDVec.elementAt( i ); 

similarlDVec = engine. getSimilar( itemlD.intValue(), 
SimilaritiesConstants.MAX_SIMILAR_ITEMS_PER_ITEM ).asVector(); 

sql = "usp_a10xSimilar_SetSimilarltems_ixxd " + itemID + " 
" + itemType + ", " + similarlDVec.size() + ", "' + Util.GetVectorAsSpaceDelimitedl_ist( 
similarlDVec ) + ; 

DBConnection.executeUpdate( sql, false ); 
Debugger.out( "Generated " + similarlDVec.size() + " 
similarities for item " + itemID ); 

} 

Debugger.outTimerMIN( "SimilaritiesGenerator", "Similarities 

Generator done." ); 
} 

catch ( InstantiationException ie ) 
{ 

System.out.println(); 
System.out.println( "usage:" ); 

System.out.println( " java SimilaritiesGenerator [item type]" ); 

} 

catch ( Exception e ) 
{ 

e.printStackTrace(); 

} 

} 

} 
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